AITopics | quantization noise

Collaborating Authors

quantization noise

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Bio-inspired Redundant Sensing Architecture

Anh Tuan Nguyen, Jian Xu, Zhi Yang

Neural Information Processing SystemsApr-21-2026, 15:26:47 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, mismatch error, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)

Genre: Research Report (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning (0.66)

Add feedback

Post training 4-bit quantization of convolutional networks for rapid-deployment

Ron Banner, Yury Nahshan, Daniel Soudry

Neural Information Processing SystemsFeb-13-2026, 22:41:38 GMT

Neural Information Processing Systems http://nips.cc/

activation, quantization, weight and activation, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Middle East > Israel (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

9431c87f273e507e6040fcb07dcb4509-Paper.pdf

Neural Information Processing SystemsFeb-10-2026

activation, quantization, sp arq, (15 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Israel > Haifa District > Haifa (0.04)

Industry: Information Technology (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Supplementary Material for PTQD: Accurate Post-Training Quantization for Diffusion Models Y efei He

Neural Information Processing SystemsFeb-9-2026, 10:16:09 GMT

ZIP Lab, Monash University, Australia We organize our supplementary material as follows: In section A, we provide a comprehensive explanation of extending PTQD to DDIM [10]. In section B, we show the statistical analysis of quantization noise. In section D, we provide additional visualization results on ImageNet and LSUN dataset. We first perform statistical tests to verify if the residual quantization noise adheres to a Gaussian distribution. This test is based on D'Agostino and Pearson's In Figure B, we present the variance of the residual uncorrelated quantization noise.

artificial intelligence, machine learning, quantization noise, (12 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.24)
Asia > China (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

PTQD: Accurate Post-Training Quantization for Diffusion Models Y efei He

Neural Information Processing SystemsFeb-9-2026, 10:16:06 GMT

artificial intelligence, diffusion model, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
Oceania > Australia (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

PTQD: Accurate Post-Training Quantization for Diffusion Models

Neural Information Processing SystemsDec-24-2025, 08:42:19 GMT

Diffusion models have recently dominated image synthesis and other related generative tasks. However, the iterative denoising process is expensive in computations at inference time, making diffusion models less practical for low-latency and scalable real-world applications. Post-training quantization of diffusion models can significantly reduce the model size and accelerate the sampling process without requiring any re-training. Nonetheless, applying existing post-training quantization methods directly to low-bit diffusion models can significantly impair the quality of generated samples. Specifically, for each denoising step, quantization noise leads to deviations in the estimated mean and mismatches with the predetermined variance schedule. Moreover, as the sampling process proceeds, the quantization noise may accumulate, resulting in a low signal-to-noise ratio (SNR) during the later denoising steps. To address these challenges, we propose a unified formulation for the quantization noise and diffusion perturbed noise in the quantized denoising process.

accurate post-training quantization, diffusion model, name change, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

GDNSQ: Gradual Differentiable Noise Scale Quantization for Low-bit Neural Networks

Salishev, Sergey, Akhremchik, Ian

arXiv.org Artificial IntelligenceNov-12-2025

Quantized neural networks can be viewed as a chain of noisy channels, where rounding in each layer reduces capacity as bit-width shrinks; the floating-point (FP) checkpoint sets the maximum input rate. We track capacity dynamics as the average bit-width decreases and identify resulting quantization bottlenecks by casting fine-tuning as a smooth, constrained optimization problem. Our approach employs a fully differentiable Straight-Through Estimator (STE) with learnable bit-width, noise scale and clamp bounds, and enforces a target bit-width via an exterior-point penalty; mild metric smoothing (via distillation) stabilizes training. Despite its simplicity, the method attains competitive accuracy down to the extreme W1A1 setting while retaining the efficiency of STE.

artificial intelligence, machine learning, quantization, (17 more...)

arXiv.org Artificial Intelligence

2508.14004

Country: North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Huang, Wei, Ge, Yi, Yang, Shuai, Xiao, Yicheng, Mao, Huizi, Lin, Yujun, Ye, Hanrong, Liu, Sifei, Cheung, Ka Chun, Yin, Hongxu, Lu, Yao, Qi, Xiaojuan, Han, Song, Chen, Yukang

arXiv.org Artificial IntelligenceOct-14-2025

We propose QeRL, a Quantization-enhanced Reinforcement Learning framework for large language models (LLMs). While RL is essential for LLMs' reasoning capabilities, it is resource-intensive, requiring substantial GPU memory and long rollout durations. Beyond efficiency, our findings show that quantization noise increases policy entropy, enhancing exploration, and enabling the discovery of better strategies during RL. To further optimize exploration, QeRL introduces an Adaptive Quantization Noise (AQN) mechanism, which dynamically adjusts noise during training. Experiments demonstrate that QeRL delivers over 1.5 speedup in the rollout phase. Moreover, this is the first framework to enable RL training of a 32B LLM on a single H100 80GB GPU, while delivering overall speedups for RL training. It also achieves faster reward growth and higher final accuracy than 16-bit LoRA and QLoRA, while matching the performance of full-parameter fine-tuning on mathematical benchmarks such as GSM8K (90.8%) and MA TH 500 (77.4%) in the 7B model. These results establish QeRL as an efficient and effective framework for RL training in LLMs.Figure 1: Rollout speedup and accuracy of QeRL on Qwen2.5-7B-Instruct. QeRL achieves faster RL rollout and end-to-end training speeds (batch=8), while delivering performance superior to vanilla LoRA and QLoRA, also comparable to full-parameter RL on mathematical benchmarks. The ability to perform multi-step reasoning is critical for large language models (LLMs) to handle complex tasks, from theoretical problem solving to practical decision making (Sui et al., 2025; Xu et al., 2025; Chu et al., 2025; Y ang et al., 2021). Supervised fine-tuning (SFT) is a common method to improve reasoning by training models to replicate explicit reasoning steps (Huang et al., 2024d; Min et al., 2024). In contrast, reinforcement learning (RL) uses verifiable reward signals to support adaptive learning, allowing models to explore diverse reasoning traces and identify more robust solutions (Lambert et al., 2024; DeepSeek-AI, 2025; Chen et al., 2025a). 1 AQN dynamically adjusts quantization noise with an exponential scheduler, enhancing exploration.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2510.11696

Genre: Research Report > New Finding (1.00)

Technology: